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AN APPLICATION OF FOURIER'S SERIES TO PROBABILITY. 

By David F. Babhow. 
Introduction. 

Many persons have been interested in compounding frequencies. 
They all seem to aim at asymptotic expressions which represent approxi- 
mately the functions obtained by compounding a large number of fre- 
quencies. The most general theorem of this sort is to the effect that if a 
great many frequencies be compounded the result approaches a Gaussian 
error function.* The generality and beauty of this theorem, and the 
fact that in most applications a great many frequencies enter, seem to 
have led writers to give little attention to those cases where only a few 
frequencies enter. Furthermore in those cases where exact formulas for 
compounding a few frequencies have been given, great difficulty is found 
in obtaining numerical results. Even Laplace confesses that in one case 
such a difficulty arrested him for a long time before he found a suitable 
approximation. 

This article gives a simple method of compounding frequency functions 
when they can be represented by Fourier's series. It has the advantages 
of being applicable over a wide range of problems and of yielding formulas 
which represent exactly the frequencies and from which numerical results 
can be obtained with relative ease. Owing to the existence of asymptotic 
formulas, it is most useful where a few frequencies are to be compounded, 
that is where the asymptotic solution is not very exact. 

In section 1 the general method is set forth. In section 2 a particular 
problem is solved. The solution looks so unlike the asymptotic solution 
that it seems of interest to compare the two directly to get an upper limit 
on the discrepancy. This is done in sections 4 and 5, and also a way is 
pointed out for comparing the exact and asymptotic solutions in various 
other cases. 

1. Fundamental theorems. 

In many probability problems we consider a variable t which may 
take on any one of an infinite number of values, for instance any value in 
some interval, finite or infinite. Of course in such a case the probability 

* For an exact statement and a simple proof of this theorem see Crofton, Phil. Trans., vol. 
160, p. 175. 
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that any particular value will be assumed is infinitesimal, but it is often 
possible to define a function fix), called the frequency of t, such that the 
integral of fix) from any value a to any larger value b gives the probability 
that t will assume a value in the interval (a, b) . Evidently f(x) must be 
everywhere positive or zero, and the integral of fix) from — oo to + co 
must be unity since this is the symbol for certainty. 

Lemma. // fix) is the frequency of t, and <pix) is the frequency of r, 
and Fix) is the frequency of t + r; then 

Fix) = f fix - X)^(X)dX = f firfvix - M )d/x. 

Proof. The probability that t + t will fall between x and x -\- Ax 
may be expressed in two ways. First it is by definition the integral of 
Fi\) from x to x + Ax, and secondly it is the double integral of /(^t)^(X) 
extended over that region of the X, /i-plane which lies between the two 
lines X + n = x and \ + p. = x + Ax. Equating these two expressions 
we have 

f*x+Ax /»oo /*x+Ax— A. /»oo f*x+Ax— /* 

F(A)dX = v(\)f(ji)diid\ = ^(X)/0i)dXd/i. 

%/z J—vt *Jx— A *-'— oo *Jx~v. 

Dividing by Ax and taking the limit for Ax = we obtain the desired 
result. 

Theorem 1. If 

iwx s 



and 
and 



fix) = \ a + £ ( «i cos -jr + &» sin 

p(a;) = 5« + 2_, I «» cos -~- + ft sin -~- I 

r./ n i ^ , -^ f A i TX , r> ^ xX N 

/^(a;) = |A + 2-, ( Ai cos -?r + -»» sin -~- 



represent the frequencies of t, r, and t + r respectively in the interval ( — C, 
C), while outside this interval all three functions are identically zero; then 

A = Ca ao, 

At = Ciaioti — 6ift), 

Bi = Ciatfi + bicu) ii = 1, 2, 3, • • •)• 

Proof. In applying the lemma to calculate Ffcc) from/(a;) and p(a;), 
the limits of integration may be reduced to — C and C, since outside this 
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interval the integrand is identically zero. Now 

f{x — X) = §ao + X, «» cos -77 (x — X) + fc,- sin ^- (a; — X) 



, , ^ T ivx *VX , . ixa; . ixX 

§«o + 2j I °» cos ~TT cos -79- -f- a» sin -77- sin -77 



-77- cus — Ty r u.» siii ^, sin -77- 



, , iwx iw\ 7 iwx . iirX 1 

+- b{ sin -77- cos -7, o,- cos -77- sin -77- . 



,-y OU6 *"y 1/ j t/'-'Q ^y 



To obtain F(x) we multiply this series, term by term, by the series repre- 
senting <p(X), and integrate with respect to X from — C to C* On 
account of the orthogonal property of the sines and cosines involved a 
great many of the resulting terms are zero. The coefficients of the 
remaining terms give us the above values for A and B in terms of a, b, 
a and (3. 

We have slurred over one difficulty, namely that when we represent 
the function fix — X) by the Fourier's series where x and X can each 
range over the interval (— C, C), the value of the argument x — X may 
lie outside this interval and for such values the series may fail to repre- 
sent f{x — X) correctly. We shall show that whenever this occurs the 
series representing tp(\) vanishes so that the result of the integration is 
not vitiated. Let (di, d 2 ) and (5 1} 5 2 ) be the smallest intervals outside 
of which f(x) and <p(x) respectively vanish identically. Then the corre- 
sponding interval for F{x) will be (di + Si, d 2 + 5 2 ). Now the interval 
(— C, C) must by hypothesis include these three intervals. Hence the 
series for fix) must converge to zero in the two intervals ( — C, di) and 
{di, C), and since it is periodic it will converge to zero for a distance 
beyond the ends of ( — C, C) and hence will represent f{x) in the wider 
interval (— 2C + d 2 , 2C + di). We need, therefore, look at only two 
cases: x — X < — 2C + d 2 , and x — X > 2C + d\. Take the first case, 
then 

x - X < - 2C + d 2 ; 
but - x < C 

and d 2 + 8 2 < C 



Adding S 2 — X < or X > 5 2 . 

Hence X must lie between 5 2 and C, and the Fourier's series for <p(X) 
will converge to zero. We find similarly for the second case that X 
must lie between — C and 5i, so that the series for <p(X) is zero in this 
case also. 



* The uniform convergence of Fourier's series is sufficient to permit this operation, cf . Picard, 
Traite d'Analyse, Vol. 1, chapter on trigonometric series. 
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The results of theorem 1 may be put into the following compact form 
by defining b , = /3 = 0, and making use of the imaginary unit V— 1: 

Ai + BiJ^l = C(flt + 6,-V^l)(ai + frJ=l) (i = 0, 1, 2, • • •). 

Equating real and imaginary parts gives the formulas. From this form 
it is evident that to find the frequency of the sum of n variables we need 
only multiply the n parentheses together and prefix the factor O n_1 . 
Then the real and imaginary parts of the product give the coefficients in 
the Fourier's series for the frequency of the sum. 

An alternate form of theorem 1 is 

Theorem 2. If 

f(x) = %r + £>,- cos l~r - gi), 

<p(x) = |/0o + Jl Pi cos ( -g — Ji ) , 

F(x) = iflc + £ Ri cos {j? - <?.-) , 

represent the frequencies of t, t, and t + t respectively in the interval ( — C, 
C) and outside this interval all three functions are zero, then 

Ri = Cr iPi (i = 0, 1, 2, • • •), 

Gi = g t + ji (i = 1, 2, 3, • • •)• 

An interesting special case occurs when the three functions are even 
in addition to satisfying the conditions named in theorem 1, for then 
each can be expressed in a series of cosines alone, causing obvious simpli- 
fications in our formulas. 

Not only can theorem 1 be used to find the frequency for the sum of 
several variables whose frequencies are known, but it may also be used 
to reverse the process. For example if the frequency for the sum of two 
variables is given, and that for one of the variables, then the frequency 
for the other may be found. Again, if an arbitrary frequency function 
be given, it may be possible in various ways to assign two or more variables 
with suitable frequencies so that their sum shall have the given frequency. 
The writer hopes to take up this question later. 

2. Solution of a problem. 

As an application of the general method let us solve the following 
problem. 

" There are n numbers each lying between — j and + \, such that 
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any value of each between these limits is equally probable. What is the 
probability that their sum will lie between s — \ and s + \, ■••?"* 

Let f(x, n) denote the frequency of the sum of n such numbers (or 
variables). Then/(x, 1) is identically zero outside the interval (— \, §), 
since the variable whose frequency it is cannot occur outside this interval. 
Furthermore all values between — \ and + § are equally probable, so 
that the integral of f(x, 1) over any subinterval thereof should be pro- 
portional to the length of that subinterval. Hence f(x, 1) must equal a 
constant in this interval, and this constant must be unity in order that 
the integral of f(x, 1) from — | to + ^ be equal to unity, the symbol for 
certainty. Thus f(x, 1) is an even function and may be developed into 
the following series of cosines: 

., 1 . 1 . 2f 1 . » iirX 

f(z>V=2C + rEi Sm 2C COS ir- 

The frequency for the sum of n such variables is easily found, by suc- 
cessive applications of Theorem 1, to be 



(1) f(x, n)=zc + — 5-gpam- [^ ) Cos 



ITTX 



Obviously if C be taken equal to n/2 it will be large enough to include all 
points where the function does not vanish identically. It may be taken 
larger without damaging the formula, in which case the series merely 
converges to zero in the intervals (n/2, C) and (— C, — (n/2)). In fact, 
on account of the periodicity of the cosines, the series converges to zero 
for a distance (C — (n/2)) beyond the ends of the interval (— C, (7), and 
therefore represents correctly f(x, n) when x lies in the interval ((n/2) 
— 2(7, 2(7 — (n/2)). Now this last statement holds true even when C has 
a value between n/4 and n/2, but the proof is too detailed to be given 
here. The fact is only of importance in calculations, as we shall wish 
to take C as small as possible in order to obtain a rapidly converging series. 
A check calculation may be made by changing the value of C. 

Now let f(s, n) denote the function which furnishes the solution of 
our problem. Then 



+(s, n) = | /(*, n)dx = 2£ + ~^r g *****" (^ ) 



^7^s 
cos -p- . 



* The problem was stated in this form by Prof. E. W. Brown in The American Math. Monthly, 
Nov., 1916, p. 341. It is a special case of a problem solved by Laplace in his Theorie Analytique 
des Probability (Paris, 1820), pp. 253-256. He finds a formula unsuitable for calculation, and 
so resorts to an approximate formula which is practically the asymptotic Gaussian error function 
which we derive in section 4. See collected works of Laplace, Vol. 12, pp. 301 ff. 
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The series represents ^(s, n) if (n + l)/2 - 2C =§ s < 2C - (n + l)/2. 
The following table for n = 100 was calculated using C = 40 and checked 
(except the last entry) using C = 30. 



s 





l 


2 


3 


4 


5 


6 


7 


8 


9 


10 


MS, 100) 


13731 


.12943 


.10841 


.08066 


.05328 


.03124 


.01624 


.00748 


.00305 


.00110 


.00035 



For negative values of s the values of <p will be the same as for positive 
values. The calculation is rather tedious as fifteen or twenty terms of 
the series have to be used. If no great accuracy is required, it is possible 
to use in this, and, in many other cases, certain well-known asymptotic 
formulas. They are taken up in the last section. 

3. A Fourier's series approximating the Gaussian error curve. 

The general theory of compounding frequencies leads us to suspect 
that a Gaussian error function would give a good approximation to the 
frequency in the problem of the last section and similar ones. The 
results of this section are developed chiefly for the purpose of taking this 
point up in the next section, yet they present some interest of their own. 
If we develop the function a/ Vir-e"" 02 ^ in a Fourier's series in the interval 
(— C, C), we obtain a series of cosines alone, since the function is even. 
The ith. coefficient is 

_ ?_ f c ~ -« izb 



■d\. 



This cannot be integrated in closed form. We therefore adopt an approxi- 
mation by extending the upper limit to infinity, thereby making an error 
in each coefficient, the total effect of which is to be examined presently. 
Denoting by a» this approximate value of the ith coefficient, we have* 



a,- 



e- a ^cos l -~d\ 






2 r_o_ 

(-> Jo ~\ir 
Hence the approximate development of aj 4Tr-e~ a * x * is 

(2) 



2C + C U 



£ e _ (42ltWC2 , cos ___ 



ITTX 

~c~ 



Now let us denote by e(x) the error made by substituting this series 
for the Gaussian error function. Then 



is J a v7T »=i L ^ Jc \jr 



e aK cos -~r d\ cos -~- 



* For this integration see B. O. Pierce, A Short Table of Integrals, p. 64, Formula 508. 
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C \7T >=1 L Jc ^ 

+ I e ** cos -jr d\ + ■ ■ ■ cos -jr • 

Next change the variables of integration as follows : In the first inte- 
gral of each term of the sum, let X = 2C — n, in the second integral of 
each term of the sum, let X = 2C + ix, in the third integral of each term 
of the sum, let X = 4C — n, in the fourth integral of each term of the 
sum, let X = 4C + y., and so on regularly. This is designed to reduce 
each set of limits to zero and C without introducing troublesome nega- 
tive signs. After slight trigonometric reductions and judicious changes 
of limits we obtain 



e(x) = -— rTe- s!(2C ~" )2 + e -" 2(2C+ ^ 2 + • • • 1 



dfi 



+ tIt= £ r\e- aVC -^~ + e -^ lc+ »->"- + ■ ■ ■ 1 cos ^%-cos 



lirX 



But we recognize this as the Fourier's series which represents that func- 
tion of x obtained by substituting x for fj, in the bracket. Hence 

Thus we have found not merely an upper limit on the error but the 
exact error for any value of x, given as a function of x. Since x must 
lie in the interval (— C, C) this error is very small even for moderately 
large values of a and C. Its greatest value occurs when x = C, and 
its smallest value when x = 0. The property most valuable to us in 
what follows is that for a fixed value of a the error approaches zero if C 
is allowed to increase indefinitely. 

It is rather curious that the Fourier's series (2) is equal to a/ -Jr ■ e~ a °~ x " 
+ e(x) not only in the interval ( — C, C) but for all values of x, since the 
latter function is periodic with a period of 2C. 

4. Asymptotic formulas for some frequencies. 

Let us compare coefficients in series (1) and (2) for the purpose of 
determining a value of a which shall make the Gaussian error function 
approximately represent the frequency in the problem of section 2. For 
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this we shall use the following more or less obvious formulas: 

ft i/i 



e 



-(»«/6) — i _ V- _i_ y 

1 6 ^72 



sin y _ y 2 y i 

y 6 + 120 



w le j U / = 72 ie j = 100(n - l) 2 • 

The last inequality is obtained by taking the maximum of the preceding 
function of y. 

Now write down the difference between the ith coefficients in series 
(2) and (1), changing their forms slightly to facilitate comparison with 
formula (3) . This gives 



L r e -l/6(i>r/2C)2l6/o 2 _ _ 



sin 



&)j 



If we take a to satisfy 6 /a 2 = n, or a = V6/n, then supposing n to be 
rather large, say twenty or more, we can simplify the maximum difference 
given by (3) and say that the maximum difference between any two 
corresponding coefficients in series (2) and (1) does not exceed 3/10 Cn. 
Next let k denote the smallest integer greater than or equal to 2C 
and consider the remainder of series (1) after the kth term. This remain- 
der is obviously less than the following 

The remainder of series (2) after k terms is still smaller. Now the average 
difference between corresponding coefficients in the first k terms in 
series (2) and (1) will not exceed one half the maximum, as is seen from 
(3), so that in the sum of these terms we cannot have an error greater 
than 3/10 n. This limit is extravagant enough to cover the error made 
in neglecting terms beyond the &th. Also the error discussed in section (3) 
can be made as small as we please by taking C large. Hence if n is not 
less than twenty in the problem of section (2), the Gaussian error function 
with a = V6/n approximates the frequency, the discrepancy being less than 
3/10 n. In actual computation the discrepancy seems to be much less. 
For the case n = 100 it is about 1/100 n. 
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As n grows infinite every ordinate in the graphs of both the frequency 
function and the Gaussian function approaches zero, so that it is evident 
a priori that the difference between corresponding ordinates must approach 
zero. This might seem to rob our result of all significance; but it will be 
noticed that the greatest ordinate in the Gaussian curve is V6/nx so 
that the difference between corresponding ordinates near the origin is an 
infinitesimal of higher order than either of the ordinates, as n grows 
infinite. 

Let us take up another problem, an extension of the previous. Given 
n variables each lying between — %l and + \l, such that every value of 
each in this interval is equally probable. Find a Gaussian error function 
approximating the frequency of their sum. A similar course of reasoning 
shows that the Gaussian function with a = V6/«l 2 gives an approxima- 
tion to the frequency whose greatest discrepancy is less than 3/10 til. 
Let us vary the problem by supposing n to increase and I to diminish in 
such a way that nl 2 remains equal to some arbitrary constant. The 
approximating Gaussian function would not change, but the discrepancy 
would approach zero. Evidently a problem of this sort could be framed 
whose frequency curve closely resembled any arbitrary Gaussian error 
curve. 

Next we shall consider the sum of n variables each having the same 
frequency f(x), which shall be identically zero outside a certain interval 
(— I, I), and capable of development in a Fourier's series of cosines in 
any larger interval ( — C, C). Then the frequency of the sum is approxi- 
mated by the Gaussian error function in which a is determined by 

(4) i- 2 =4n f\ 2 /(X)dX, 

and the approximation becomes better as n increases. 
A sketch* of the proof is as follows: We have 

f{x) = 2Q + 2-, a,i cos -£- . 

The constant term is known since f(x) is a frequency function and there- 
fore its integral from — C to C is unity. The frequency for the sum of 
the n variables is, by application of Theorem 1, 

(5) 2C + Cn ~ X % ai " cos "77" • 



* Bessel has obtained this result and the following one; Astr. Nachr., vol. 15, pp. 369^405. 
Our only excuse for giving them here is that the use of Fourier's series in this connection seems new 
and interesting. 
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Now we know that a,- is less than Aji, where A is some constant.* 
Consequently the remainder beyond a certain term in series (5) will be 
very small. To show that the earlier terms in series (5) and (2) are 
nearly equal when a is chosen as above, we exhibit a* as follows: 

a> = 79 I /(\) cos Yrd\ = tj I fW cos-^- aX 



2 

C\ 



The last form is obtained by developing cos iirX/C into a power series. 
Now if we develop the ith coefficient in (2) into a series of powers of the 
exponent of e, and compare with the above, it will be seen that we have 
so chosen a in (4) that the coefficients of (5) and (2) are nearly equal 
for small values of iw/C, that is for the earlier terms of the two series. 
The proof can be made rigorous. 

A further generalization of the problem is to consider n variables each 
with a different frequency function, fi(x), fz(x), ■ ■ •, f n (x). If each of 
these functions is zero outside of an interval (— I, I) and capable of 
development in a Fourier's series of cosines, then the Gaussian error 
function in which a is determined by 



1 4± f\%(\)d\ 



will approximate the frequency in an asymptotic manner. 

The question of asymptotic formulas when frequencies are compounded 

which are not even functions has been investigated by other methods. It 

looks as if theorem 1 might be used to establish these results also, but 

the process would probably be too tedious for interest. 

Sheffield Scientific School, 
February 2, 1917. 

* Picard, loc. cit., p. 253. 



